AITopics | manipulating image

Fader Networks:Manipulating Images by Sliding Attributes

Neural Information Processing SystemsNov-21-2025, 15:02:52 GMT

This paper introduces a new encoder-decoder architecture that is trained to reconstruct images by disentangling the salient information of the image and the values of attributes directly in the latent space. As a result, after training, our model can generate different realistic versions of an input image by varying the attribute values. By using continuous attribute values, we can choose how much a specific attribute is perceivable in the generated image. This property could allow for applications where users can modify an image using sliding knobs, like faders on a mixing console, to change the facial expression of a portrait, or to update the color of some objects. Compared to the state-of-the-art which mostly relies on training adversarial networks in pixel space by altering attribute values at train time, our approach results in much simpler training schemes and nicely scales to multiple attributes. We present evidence that our model can significantly change the perceived value of the attributes while preserving the naturalness of images.

fader network, manipulating image, name change, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (0.60)

Add feedback

Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

Neural Information Processing SystemsNov-20-2025, 23:03:14 GMT

This paper addresses the problem of manipulating images using natural language description. Our task aims to semantically modify visual attributes of an object in an image according to the text describing the new visual appearance. Although existing methods synthesize images having new attributes, they do not fully preserve text-irrelevant contents of the original image. In this paper, we propose the text-adaptive generative adversarial network (TAGAN) to generate semantically manipulated images while preserving text-irrelevant contents. The key to our method is the text-adaptive discriminator that creates word level local discriminators according to input text to classify fine-grained attributes independently. With this discriminator, the generator learns to generate images where only regions that correspond to the given text is modified. Experimental results show that our method outperforms existing methods on CUB and Oxford-102 datasets, and our results were mostly preferred on a user study. Extensive analysis shows that our method is able to effectively disentangle visual attributes and produce pleasing outputs.

manipulating image, name change, text-adaptive generative adversarial network, (4 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.61)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)

Add feedback

Reviews: Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

Neural Information Processing SystemsOct-8-2024, 05:41:18 GMT

After rebuttal comments: * readability: I trust the authors to update the paper based on my suggestions (as they agreed to in their rebuttal). For AttrGAN, they did change the weight sweep and for SISGAN they used the same hyperparameters as they used in their method (which I would object to in general, but given that the authors took most of their hyperparameters from DCGAN, does not create an unfair advantage). I expect the additional details of the experimental results to be added in the paper (as supplementary material). Ensure that content that is not relevant to the text does not change. Method: to avoid changing too much of the image, use local discriminators that learn the presence of individual visual attributes.

artificial intelligence, discriminator, machine learning, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

Reviews: Fader Networks:Manipulating Images by Sliding Attributes

Neural Information Processing SystemsOct-7-2024, 19:40:42 GMT

These attributes are known at training time and for a dataset of faces include aspects like [old vs young], [smiling vs not smiling], etc. They hope to be able to tweak these attributes along a continuous spectrum, even when the labels only occur as binary values. To achieve this they propose an (encoder, decoder) setup where the encoder maps the image x to a latent vector z and then the decoder produces an image taking z, together with the attributes y as inputs. When such a network is trained in the ordinary fashion, the decoder learns to ignore y because z already encodes everything that the network needs to know. To compel the decoder network to use y, the authors propose introducing a adversarial learning framework in which a discriminator D is trained to infer the attributes from z. Thus the encoder must produce representations that are invariant to the attributes y. The writing is clear and any strong researcher should be able to reproduce their results from the presentation here.

fader network, manipulating image, sliding attribute, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.76)

Add feedback

Fader Networks:Manipulating Images by Sliding Attributes

Lample, Guillaume, Zeghidour, Neil, Usunier, Nicolas, Bordes, Antoine, DENOYER, Ludovic, Ranzato, Marc', Aurelio

Neural Information Processing SystemsFeb-14-2020, 18:26:18 GMT

This paper introduces a new encoder-decoder architecture that is trained to reconstruct images by disentangling the salient information of the image and the values of attributes directly in the latent space. As a result, after training, our model can generate different realistic versions of an input image by varying the attribute values. By using continuous attribute values, we can choose how much a specific attribute is perceivable in the generated image. This property could allow for applications where users can modify an image using sliding knobs, like faders on a mixing console, to change the facial expression of a portrait, or to update the color of some objects. Compared to the state-of-the-art which mostly relies on training adversarial networks in pixel space by altering attribute values at train time, our approach results in much simpler training schemes and nicely scales to multiple attributes.

fader network, manipulating image, sliding attribute

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (0.64)

Add feedback

Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

Nam, Seonghyeon, Kim, Yunji, Kim, Seon Joo

Neural Information Processing SystemsJan-11-2020, 04:35:32 GMT

This paper addresses the problem of manipulating images using natural language description. Our task aims to semantically modify visual attributes of an object in an image according to the text describing the new visual appearance. Although existing methods synthesize images having new attributes, they do not fully preserve text-irrelevant contents of the original image. In this paper, we propose the text-adaptive generative adversarial network (TAGAN) to generate semantically manipulated images while preserving text-irrelevant contents. The key to our method is the text-adaptive discriminator that creates word level local discriminators according to input text to classify fine-grained attributes independently.

manipulating image, natural language, text-adaptive generative adversarial network, (2 more...)

Neural Information Processing Systems

Genre: Research Report (0.43)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.65)

Add feedback

Filters

Collaborating Authors

manipulating image

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Fader Networks:Manipulating Images by Sliding Attributes

Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

Reviews: Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

Reviews: Fader Networks:Manipulating Images by Sliding Attributes

Fader Networks:Manipulating Images by Sliding Attributes

Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language